ELBlocker: Predicting blocking bugs with ensemble imbalance learning
نویسندگان
چکیده
http://dx.doi.org/10.1016/j.infsof.2014.12.006 0950-5849/ 2015 Elsevier B.V. All rights reserved. ⇑ Corresponding author. E-mail addresses: [email protected] (X. Xia), [email protected] (D. Lo), [email protected] (E. Shihab), [email protected] (X. Wang), yangxh@ zju.edu.cn (X. Yang). 1 In this paper, we use the terms ‘‘bug’’ or ‘‘bug report’’ interchangeabl refer to an issue report stored in a bug tracking system that is marked as a Xin Xia , David Lo , Emad Shihab , Xinyu Wang a,⇑, Xiaohu Yang a
منابع مشابه
Diversified Ensemble Classifiers for Highly Imbalanced Data Learning and their Application in Bioinformatics
In this dissertation, the problem of learning from highly imbalanced data is studied. Imbalance data learning is of great importance and challenge in many real applications. Dealing with a minority class normally needs new concepts, observations and solutions in order to fully understand the underlying complicated models. We try to systematically review and solve this special learning task in t...
متن کاملEnsemble diversity for class imbalance learning
This thesis studies the diversity issue of classification ensembles for class imbalance learning problems. Class imbalance learning refers to learning from imbalanced data sets, in which some classes of examples (minority) are highly under-represented comparing to other classes (majority). The very skewed class distribution degrades the learning ability of many traditional machine learning meth...
متن کاملCUSBoost: Cluster-based Under-sampling with Boosting for Imbalanced Classification
Class imbalance classification is a challenging research problem in data mining and machine learning, as most of the real-life datasets are often imbalanced in nature. Existing learning algorithms maximise the classification accuracy by correctly classifying the majority class, but misclassify the minority class. However, the minority class instances are representing the concept with greater in...
متن کاملEnsemble of online neural networks for non-stationary and imbalanced data streams
Concept drift (non-stationarity) and class imbalance are two important challenges for supervised classifiers. “Concept drift” (or non-stationarity) refers to changes in the underlying function being learnt, and class imbalance is a vast difference between the numbers of instances in different classes of data. Class imbalance is an obstacle for the efficiency of most classifiers. Research on cla...
متن کاملEnsemble-based active learning for class imbalance problem
In medical diagnosis, the problem of class imbalance is popular. Though there are abundant unlabeled data, it is very difficult and expensive to get labeled ones. In this paper, an ensemble-based active learning algorithm is proposed to address the class imbalance problem. The artificial data are created according to the distribution of the training dataset to make the ensemble diverse, and the...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Information & Software Technology
دوره 61 شماره
صفحات -
تاریخ انتشار 2015